DISTRIBUTIONS OF THE LARGEST SINGULAR VALUES OF SKEW-SYMMETRIC 
RANDOM MATRICES AND THEIR APPLICATIONS TO PAIRED COMPARISONS 



Satoshi Kuriki 

The Institute of Statistical Mathematics 

10-3 Midoricho, Tachikawa, Tokyo 190-8562, Japan 

kur iki@ism. ac . j p 

Key Words: Bradley- Terry model; Complex Wishart matrix; Three-way deadlock; Tube 
method. 

ABSTRACT 

Let Ahe a real skew-symmetric Gaussian random matrix whose upper triangular elements 
are independently distributed according to the standard normal distribution. We provide 
the distribution of the largest singular value ai of A. Moreover, by acknowledging the fact 
that the largest singular value can be regarded as the maximum of a Gaussian field, we 



deduce the distribution of the standardized largest singular value ai/ ^ytT{A'A)/2. These 
distributional results are utilized in Scheffe's paired comparisons model. We propose tests 
for the hypothesis of subtractivity based on the largest singular value of the skew-symmetric 
residual matrix. Professional baseball league data are analyzed as an illustrative example. 

1. INTRODUCTION 

Let A = (ttij) he a p X p real skew-symmetric Gaussian matrix whose upper triangular 
elements CLij {1 < i < j < p) are independently distributed according to the standard normal 
distribution. The density function of A is given by 



1 1 

__ exp|--tr(A'A) } dA, dA = \{ da, 

i<j 



(27r)p(p- 

The singular value decomposition of A is given by 



-4 = ^ (Ti {u2i-lU2, - U2iU2,_i) , (2) 



1=1 
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where ci > ■ ■ ■ > > 0, t = [p/2] (the integer part of p/2), are the nonnegative singular 
values, and Ui is the ith column vector of a p x p orthogonal matrix U. In this paper, 
we derive the distributions of ui and its standardized version (^i/ erf, which are the 
largest singular values of the skew-symmetric matrices A and A/ •>/tr(A'A)/2, respectively. 

These distributional results are utilized in the analysis of paired comparisons. Suppose 
that there are m objects (treatments, stimuli, etc.) Oi, . . . , Om and that paired comparisons 
are made for all (™) pairs. For each i < j, ijij is assumed to be the observed degree of 
preference of Oj over Oj. The observation {uij) is written as an m x m skew-symmetric 
matrix by letting yji — —i/ij and yu — 0. For such data, Scheffe (1952) proposed an analysis 
of variance based on the following linear model: 

Vij = IJ-ij + Sij, Hij = {ai - aj) + -fij (1 <ij <m), (3) 

where eij {i < j) are independently distributed according to the normal distribution iV(0, a^) 
with the mean and the variance cr^. The parameters and 'fij are called the main effect 
and the interaction, respectively. When the no interaction hypothesis Hq : jij = is true, 
the model is easily interpreted since the relative preference /lij is the difference of the scores 
ai and aj. For this reason, Hq is called the hypothesis of subtractivity. In this paper, we 
propose the use of the largest singular value of the interaction estimator matrix {'Jij) as the 
test statistics for testing Hq in the following two cases: (i) the error variance is known 
or an independent estimator is available, and (ii) is unknown and no independent 
CT^ is available. The proposed tests are max-type statistics which suggest the direction of 
discrepancy with Hq when it is rejected. 

This paper is arranged as follows. In Section 2, the joint distribution of singular values 
((Ti, . . . ,at) of A, and the marginal distribution of (Xi are given. Moreover, acknowledging 
the fact that ai can be regarded as the maximum of a Gaussian random field on a manifold, 
the distribution of ai/ \J Y^i=i derived by modifying that of ai using the tube method 
(Kuriki and Takemura (2001), Takemura and Kuriki (2002), Adler and Taylor (2007)). In 
Section 3, test statistics for Hq based on the largest singular values are proposed. Further- 
more, professional baseball league data are analyzed as an illustrative example. 
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2. DISTRIBUTION OF THE LARGEST SINGULAR VALUES 
2.1. Singular value decomposition 

The set of p x p real skew-symmetric matrices is denoted by Skew{p). Let A = [aij) G 
Skew{p) be a Gaussian random matrix whose upper triangular elements aij {i < j) are 
independently distributed according to the standard normal distribution. The density of A 
is given in ([1]). The singular value decomposition ([2]) is rewritten as 

A = UD^U', D„ = diag(ai J, . . . , at J) if p = 2t, 

= diag{ai J,. .. ,at J, 0) ifp = 2t + l, (4) 

where a = {ai, . . . ,o"(), o"i > ■ ■ ■ > at > 0, t = [p/2], is a vector of nonnegative singular 

values, J = \ , and U E 0{p), the set of p x p orthogonal matrices. 

In (jlj), U is not determined uniquely as an element of 0{p) since A = {UH)D„{UHy 
holds for any H G H{p), where 

H{p) = {diag(i/i, ...,Ht)\H,e S0{2)} if p = 2t, 

= {diag(/7i, ...,Ht,e)\Hie 30(2), e = ±1} if p = 2t + 1, 

is a subgroup of 0{p). Here, SO (2) denotes the set of 2 x 2 orthogonal matrices with the de- 
terminant L Conversely, ?7 in (j3]) is defined uniquely as an element of the left quotient space 
U{p) = {UH{p) I U G 0{p)} (= 0{p)/H{p), say) when all the singular values ai,. . . ,at are 
distinct and positive, which is the case with probability 1 in our application. By introducing 
a quotient topology, U{p) becomes a manifold of the dimension p{p — l)/2 — t. 

The Jacobian of the transformation of (j4j) was given by Lemma 2 of Khatri (1965) as 

t 

dA = l[arl[i^^-a^YdadU, (5) 

1=1 i<j 

where e = p — 2t (= for p even, = 1 for j9 odd), dA = Yli^j daij, da = IlLi d'^i^ 

dU = yAy u'jdui (6) 
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with U = . Ip = \ l<i<j<p}\ {(2/i- l,2h)\l<h< t}. 

The differential form dU in ([6]) is well-defined on U (p) because dU is independent of the 
choice of {ui}, that is, 

f\ u'jdui = f\ u'jdui (7) 

holds for (ui, . . . ,Up) = {ui, . . . ,Up)H , H G H{p). (The proof of ([7]) is similar to (2) in 
Section 4.6 of James (1954), and is omitted.) Moreover, dU is invariant with respect to the 
orthogonal transformation U h-> QU ^ Q G 0{p). (The proof is similar to (3) in Section 4.6 
of James (1954), and is omitted.) The volume Vol(f/(p)) = Jij(^p^dU of the manifold U{p) 
is needed to determine the normalizing constant of the density of a. This is calculated as 
follows. 

Lemma 1 

Vol(f/(p)) = — 7 = r-TT for p even, 

nlir(p/2-^ + i)r(p/2-^ + i/2) nr=ir(V2) 



2tjj.p{p-l)/4: 

where t = [p/2]. 



for p odd, 



Proof. Similar to the proof of Theorem 5.1 of James (1954) obtaining the volume of the 
Stiefel manifold, we can prove the recurrence relation 

Yo\{U{p)) = Vo\{G{2,p)) Yo\{U{p - 2)) 

with Vol([/(2)) = 2, Vol(t/(l)) = 1, and 

r ^ 

Vol(G(2,p)) = / f\f\u'^du, 

is the volume of the oriented Grassmann manifold G{2,p) (Appendix A.l). The results follow 
from the fact that 

with = 2WVr(n/2) ((5.23) of James (1954)). I 
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2.2. Distributions of the largest singular value 

Substituting @ and ([5]) into ([1]), and integrating ([1]) with respect to U over U{p), we 
have the joint density of o"i > ■ ■ ■ > at > as 

i i i<j 

where Cp = Vol(?7(p))/(27r)P'^P~^^/^ is the normalizing constant. The integration of the joint 
density ([8]) over x > cti > ■ ■ ■ > at > yields the cumulative distribution function of 
the largest singular value P(o"i < x). Since the linkage factor in ([8]) is written as the 
Vandermonde determinant ni<j('^j^ ~ '^j) — '^^K'^i*^*~*^)i<i j<t' have 

P(a, <x)=cp[ det ar^'crr'') fl e^'/' da, 

Jx>ai>->(Tt>0 ^k=l ' k=l 

= Cp det ( f g-.V2 ^^"j 

\"'0 / l<i,j<t 

The last equality in the expression above follows from the determinental Binet-Cauchy for- 
mula ((2.1) of Krishnaiah (1976), (2.12) of Karlin and Rinott (1988)). Making a change of 
variable, we obtain the following theorem. 

Theorem 1 The distribution function of the largest singular value ai is given by 



P[ai < x) = dp det 



where t = [p/2], 



"o = -7 = — } — ,,,, „„ — ^, , ■ tor p even, 
p 2* 2P(p-i)/4 nf=ir(z/2) ^ 

forp odd. 



2P(p-i)/4 nt2r(V2) 

Remark 1 Let f/*! > ■ ■ ■ > 0* > &e the eigenvalues of a t x t central complex Wishart 
Hermitian matrix CWt{t + e — 1/2, It) with e = p — 2t. Then, it is observed that the joint 
density ofaf/2 {1 <i <t) coincides with that of (pi {I <i <t). (See (102) of James (1964)-) 
Accordingly, the marginal distribution of the largest eigenvalue of the complex Wishart matrix 
obtained by Khatri (1964) ^■^ consistent with TheoremUi 



2.3. Upper probability of the standardized largest singular value 

Assume that the linear space of p x p real skew-symmetric matrices Skew{p) is endowed 
with the metric {A,B) = tT{A'B)/2 = J2i<j ^ij^ij^ ^ — i.'^ij)^ B = ipij) G Skew{p). Let 

V{2,p) = {{h,h2) :px2 \ h[h^ = h'^h2 = 1, h[h2 = 0} 

be the set of p x 2 orthogonal matrices, that is, a Stiefel manifold. The largest singular value 
of a skew-symmetric matrix A is written as 

0"! = max h\Ah2 = max (h\Ah2 — h'^Ahi) 12 
= max ii[{hih'^ - h2h\)' A]/2 = max {H,A), 

where 

M = M{p) = {hih'2 - h2h[ I {hu h2) e V{2,p)] C Skew{p). (10) 

Let n = dim Skew [p) = p{p — l)/2. It is evident that M is a subset of the unit sphere 
{A G Skew{p) I {A, A) = 1} of Skew{p) with the dimension n — 1. Moreover, as shown 
in Appendix A.l, M is diffeomorphic to an oriented Grassmann manifold G{2,p — 2) = 
V{2,p)/SO{2) with the dimension 

d = dimM = dimG(2,p- 2) = dimV{2,p) - dim 50(2) = 2(j9- 2). 

The density ([T]) of A is rewritten as 

^exp{-l(AA)}rfA 

where n = dim Skew{p) and dA = Yli^j daij is the volume element of Skew{p) at A induced 
by the metric (■,■). This means that the distribution of A is the standard multivariate 
normal distribution in Skew{p). According to the general theory of the tube method (Kuriki 
and Takemura (2001), Takemura and Kuriki (2002)), there exist coefficients Wd+i,Wd-i, ■ ■ ., 
referred to as Weyl's geometric invariants, such that 

P{(Ti > x) = P[ max {H, A) > x 



\HeM 

[d/2] _ 

^Wd+i-2kGd+i-2k{x'^) + o{e-''^/'^), x^oo, (11) 

A:=0 
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and 



ld/2] 



W^+l-2fc-B(rf+i,2fc)/2,(n-d-l+2fc)/2(a^^), X > COS (12) 



A;=0 



hold, where 6'c > is a geometric quantity of M called the critical radius, G^{-) is the upper 
probability of the chi-square distribution with u degrees of freedom, and Ba^b{') is the upper 
probability of the beta distribution with the parameter (a, 6). 

Let C = (2P-'--'+i/2 Y{p-i-j + 1/2))^^^. t = [p/2]. From Theorem^ we have 



P{ai>x) = 1- det(C)-i det (^J^ ^P-^-j-l/2^-H2 

= 1 - det(C~^) det (^C - (^j^ (pP^'^-^'^e-^ 



l<i,j<t 



/^p-i-i-l/2g-0/2 ^ 



l<M<t 



Noting that 

POO 

we have 

P(ai > x) = tr (c-' ( r i-i/2g-</./2 \ ^ o{e~^V2^^ 

\ \J / l<i,j<t/ 

t 

= 9''9^jG2p^2^-2Mx') + o{e-^"/'), (13) 

where gij = T{p—i—j + l/2) and g''^ is the (i, j)th element of the inverse matrix of {gij)i<i,j<t- 
Comparing (fT3|) and (fTTj) . we obtain the theorem below. 

Theorem 2 When p > 4, the upper probability of the standardized largest singular value is 
given by 

P I , > ^ I = 5Z ^'^^'^ B{2p-2i-2j+l)/2,{n-2p+2i+2j-l)/2{x^), X > 1/^2, (14) 

with n = p{p — l)/2. 
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Proof. Weyl's geometric invariants in (HI]) are given by Wd+i-2k = W2p^z^2k = I]i+j=fc+2 9'^9ij- 
Since this asymptotic expansion is uniquely represented, we have f|T4l) from f|T2|l . The critical 
radius 6c is proved to be 7r/4 in Appendix A. 2. | 



Lemma 2 The {i,j)th element of the inverse matrix of {gij)i<i,j<t with gij = r{p — i — j + 
1/2), t = [p/2], is explicitly given as follows. 



9- 



r(t + 1 - 1) r{t + e + 1/2 - i) r(t + 1 - j) r(t + e + 1/2 - j) 

min(i,j) 

X 

k=l 



T{t + l-k)T{t + e + 1/2 -k) 
^ T{t + l-k)T{j + l-k) ' 



(15) 



A sketch of the proof of Lemma [2] is provided in Appendix A. 3. 



Table I shows the upper probabilities at the critical point -P(cri/ \/^^ > 1/V2). Note 
that when p = 4 or 5, (jf/{af + cr|) > 1/2 holds with probability 1. This table shows that 
when p is not so large, the formula given in Theorem [2] provides a wide range of values for 
the upper probabilities. 

Table I. Upper probabilities at the critical point 1 / \/2. 



p 


prob. 


p 


prob. 


p 


prob. 


p 


prob. 


4 


1.0000 


8 


0.9614 


12 


0.3236 


16 


0.0048 


5 


1.0000 


9 


0.8827 


13 


0.1603 


17 


0.0009 


6 


0.9989 


10 


0.7354 


14 


0.0634 


18 


<0.0001 


7 


0.9913 


11 


0.5328 


15 


0.0197 







Remark 2 According to the Gauss-Bonnet theorem (Lemma 3.5 of Kuriki and Takemura 
(2001), Corollary 3.1 of Takemura and Kuriki (2002)), the Euler-Poincare characteristic of 
the manifold M , or equivalently that of G{2,p — 2), is given by 

[d/2] t 

X{M) = x{G{2,p- 2))=2J2 ^d+i-2k = 2 a'' 9^, = 2 [p/2\. 

k=l «,i=i 
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3. APPLICATIONS TO PAIRED COMPARISONS 

3.1. Tests for the hypothesis of subtractivity 

In the paired comparisons model ([3]), we assume the side conditions ai = and 
Si lij ~ Ylij lij — 0- The least square estimators of aj and are given by 

m 

ai = ^yij/m, % = yij - (ai-aj), (16) 
i=i 

respectively. SchefTe (1952) showed that Yli<j^ij/'^'^ follows the chi-square distribution with 
(m — l)(m — 2)/2 degrees of freedom when the hypothesis of subtractivity Hq : = holds. 
Based on this property, when o"^ is known, or unknown but there exists an independent 
estimator a"^ of o"^ made from the replication of the observations, the chi-square or F statistics 
for testing Hq can be constructed. 

Let r = {'jij) and F = (%■). The ith largest nonnegative singular value is denoted by 
crj(-). In this paper, instead of Scheffe (1952) 's ANOVA described above, we propose test 
statistics ai(T)/a when cr^ is known, or cri(r)/a when cr^ is unknown. Because o"i(r) = 
if and only if F = 0, our proposed tests are consistent. The following lemma about the 
distribution of the singular values of F can be easily proved. The critical points of the 
proposed test are calculated by virtue of this lemma. 

Lemma 3 Under the null hypothesis Ho, the distribution followed by the nonzero singular 
values crj(F), z = 1, . . . , [(m — l)/2], is the same as that followed by the (m — 1) x (m — 1) 
skew- symmetric matrix A whose density is (JD with p = m — 1. 

One advantage of our proposed test is that the statistic (Xi (F) provides us with some 
suggestions about the direction of discrepancy with the null hypothesis Hq when it is rejected. 
Note that the largest singular value is ai (F) = maxc^^ c'Frf, where the maximum is taken 
over c = (ci, . . . , Cm)' and d = {di, . . . , dm)' such that 

i i i i i 

When Hq is rejected, we can examine the contrast functions c'Td such that c'Td is large. 
Although it is difficult to interpret the contrasts c'Td in general, it must be noted that 
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this class of contrasts includes an interesting subclass of contrasts, as described below. Let 
c = (ci, . . . , Cm)' with Cj = I/V2, Cj = — 1/-\/2, Q = (Z 7^ i, j), and let d = {di, . . . , dm)' 
with di = dj = I/VG, dk = —2/^/6, di = {I ^ k). Then, 

c'Td = {jij + 'jjk + 7fci)/V3 = {fiij + fijk + liki)/ VS, 

which represents a departure from the hypothesis of subtractivity among the three objects 
Oj, Oj and Ok- This was introduced in the context of the Bradley- Terry model and named 
as the three-way deadlock parameter by Hirotsu (1983). 

So far, we have considered the case where is known or there exists an independent 
estimator a^. When cr^ is unknown and no estimator of o"^ is available, testing Hq on 
the basis of model ([1]) is no longer possible. Instead, we assume a specific model for the 
interaction. The following is an analogue to the test for interaction in the two-way layout 
without replication proposed by Johnson and Graybill (1972). 

Theorem 3 For the m x m paired comparisons data {uij), assume the model 

fiij = (a, — aj) + X{cidj — diCj), A > 0, 1 < i, j < m, 

where Ci and di satisfy [T7\ ), and Sij {i < j) are independently distributed according to the 
normal distribution N{0, o"^) with cr^ unknown. Then, the likelihood ratio test statistic for 
testing the hypothesis X = is 

-1 (r) / VEEP^M?) = -1 (f ) / Vtr(rf)/2, 

that is, the standardized largest singular value ofT= {^ij) , where is given in l[TS\) . 

According to Lemma [31 the upper probability (p- value) of the likelihood ratio statistic 
proposed in Theorem [3] can be calculated by Theorem [2] within the range of Table L 

3.2. Analysis of professional baseball data 

Table II gives the score sheet of the Central League, one of Japan's professional baseball 
leagues, in 1997. For any pair of teams, the total number of games is n = 27. Let Vij 
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{1 < i, j < m = 6) he the number of games where Team i beats Team j, as indicated in the 
(z, j)th cell of Table II. There are no ties in this table, that is, Vij + rji = n. 

Table II. Score sheet of the Central League in 1997. 



Team i \ j 


1 


2 


3 


4 


5 


6 


1. Yakult 




13 


15 


19 


20 


16 


2. Yokohama 


14 




16 


13 


10 


19 


3. Hiroshima 


12 


11 




13 


12 


18 


4. Yomiuri 


8 


14 


14 




14 


13 


5. Hanshin 


7 


17 


15 


13 




10 


6. Chunichi 


11 


8 


9 


14 


17 





When we suppose that all games are independent trials, we can consider Vij to be a 
random variable following the binomial distribution Bin(n, gj^), where qij is the probability 
that Team i beats Team j. In order to analyze this data based on the model ([3]), we use the 
variance stabilizing transformation (Rao (1973), 6g.3) : 

/(g) = 2v^(sin-iv^-7r/4). 

Then, f{rij/n) approximately follows the normal distribution N{f{qij), 1). 

In league games, however, the games in each pair are sometimes arranged within a short 
time interval. Due to this game design, some serial correlations may occur. If the correlation 
is positive, the over dispersion is expected to be observed. In such a situation, /(rjj) can be 
modeled as N{f{qij),cr'^) with the variance unknown. 

The test statistics for testing the subtractivity and their p-values are as follows: the 
chi-square statistic tr(r'r)/2 = 15.765 (d.f. = 10, p-value = 0.1066), the largest singular 
value cri(r) = 3.932 (p- value = 0.0543), and the standardized largest singular value ai(T)/ 
\J^i ^i^Y ~ 0-990 (p-value = 0.0348). (The other nonzero singular value is o"2(r) = 
0.553.) In any cases, the hypothesis of subtractivity is found to be suspicious, or is rejected. 
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The maximum contrast of three-way deadlock is 



max 

i<j<k, i>j>k 



{% + Ijk + Iki) /V^ = (765 + 752 + 726) / = 2.832. 



(18) 



Figure I is the residual plot, which was introduced for the Bradley- Terry model by 
Takeuchi and Fujino (1988). In this figure, m points {y/o\Ui, y/diVi) {1 < i < m) are 
plotted, where Ui is the largest singular value of F (the residual matrix under the null hy- 
pothesis), and M = (mi, . . . , Um)' and v = {vi, . . . , Vm)' are the eigenvectors that correspond 
to oi. This plot is based on the two-rank approximation F ^ ai{uv' — vu'). This idea of 
plotting a skew-symmetric matrix by two-rank approximation originates from Gower (1977). 
It can be observed that the triplet teams 6, 5 and 2, which gives the maximum contrast of 
three-way deadlock, form a large triangle around the origin in the counterclockwise direction. 

Note that if F = ai{uv' — vu') holds exactly, the signed area Sijk of the triangle with 
the apexes (a/oTm;, i/oTf/) {I = k) in the counterclockwise direction satisfies 2Sijk/\/S = 
(7jj + Tj-fc + 7fci) / a/3. In our example, 2Sq52/V^ = 2.839, which is very close to the maximum 



value 2.832 in 



*2 



1 ■■ 



* 3 



* 5 



* 4 



-1 







1 



6 * 



Figure I. Residual plot by two-rank approximation. 
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APPENDIX 

A.l. The diffeomorphism between M and G{2,p — 2) 

Define the group action of SO{p) on the space Skew{p) by X H- HXH' G Skew{p) for 
X G Skew{p), H G SO{p). This action is obviously of C°°. Let 

[j o\ f 1 

Xo= \ G Skew{p), J 

\o oy V-i 

Then, the orbit that Xq belongs to is given by 

SO{p){Xo) = {HXqH' g Skew{p) \ H G SO{p)} 

= |/ii/i2 — /12/ii I h[hi = h'2h2 = 1, h[h2 = O}, 
which is the index manifold M in (|T0|) . The isotropy subgroup of Xq is 

50(p)xo = {^^ G SO{p) I Moi/' = Xo} = {He SO{p) I Mo = XoH} 
'Hi 

//2^ 

= SO [2) X S0{p-2), say. 

Let SO{p) / SO{p)xq = {H SO{p)xo \ H G SO{p)} be the quotient space endowed with the 
quotient topology. Define a map / : SO{p)/ SO{p)xq — ?■ SO{p){Xq) by f{HSO{p)xa) = 
HXqH'. Then, according to Theorem 4.3 and Corollary 4.4 of Kawakubo (1991), / gives a 
embedding from SO{p) / SO{p)xo to S'0(p)(Xo) (endowed with the relative topology). 
That is, SO{j))/SO{j))x^ = SO{p){Xo) = M (C°°-diffeomorphism). 

SOijp) / SO{j))xo = SO{p) / {S0{2) X S0{j) — 2)) is called the oriented Grassmann manifold 
G(2,p-2). 

A. 2. The critical radius of M 

Let t = {f)i<i<d, d = 2{p — 2), denote the local coordinates of the submanifold M in ffTOj) 
of Skew{p) so that an element of M is written as = 0(t) = hih2 — h2h[ = HJH', where 

1' 



Hi G S0{2), H2 G S0{p-2) 



H = H{t) = {hi,h2)eV{2,p), J 
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The critical radius 9c of M is given by 

t/t {i-{ci>A)Y ' 

where : Skew{p) — > Skew{p) is the orthogonal projection onto the subspace spanned 
by {4>, 01, ... , (f>d}, (pi = dcp/df (Lemma 3.1 of Kuriki and Takemura (2001), Lemma 2.1 of 
Takemura and Kuriki (2002)). Recall that Skew{p) is a linear space of p x p real skew- 
symmetric matrices endowed with the metric {A,B) — tr{A'B)/2. 

Let G = (Qij) = and G'^ = {g'^). Noting that (0,0) = tr(0'0)/2 = 1 and 

hence (0, 0i) = tr(0'0i)/2 = 0, we see that 

(0,P40)> = ^{tr(0'0)}' + ^ J] tr(0'0,)tr(0'0,)^^^-. 

Moreover, 

{tr(0'0)}2 = {iT[HJ'H'HJH')Y = {tr(i?Ji?'J)}' 
with H^H{t),R^ H'H, and 

tr(0'0i) = tr {H J' H' {Hi J H' + Ji/^)) = -2tr {j R' J H' Hi) 

with = dH/df. Note that i? = if'i? is a 2 x 2 real matrix such that the absolute values 
of the eigenvalues are less than or equal to 1, and = if and only if i? G SO {2). Since 
H'Hi is skew-symmetric, we can put Hi — hiHJ + HCi with hi a scalar, H a. p x (p — 2) 
matrix such that (if, H) is p x p orthogonal, and Q a p x 2 matrix. Therefore, 

tr(0'0i) = -2tr {J R' J H'{biH J + HCi)} = -2tr{JR' JH'Hd). 

On the other hand, as 

9ij = 0i) = 'tr(0'.0j)/2 

= tr{{HiJ'H' + HJ'H'i){HjJH' + HJH'^))/2 

= ti:(H'iHj) - tr(H'HiJH'HjJ) = {tr(C;Q) + 26^6^} - 2bibj 

= tr(c;c,). 
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we have 



J]]tr(0Vi)tr(0'0j)c/^^' = Ati{{jR'JH'H) {jR'JH'll)'} 



4tT{R'JH'{I - HH')HJ'R] = 4tr(i?i?') + Atr {RR' J RR' J). 



After summarizing the above, and conducting some further calculations, we obtain 

1 - {tT{RJR'J)}y4: - tT{RR') - tT{RR'JRR'J) 



cot 6c = sup 

^^50(2) 



sup n — 



1 + tr(i?Ji?'J)/2)2 



R^S'0{2) 

where R = (rij)i<ij<2. 

From the expression above, it holds cot^ ^ 1 obviously. On the other hand, when 
p > 4, R = H'H can be a zero matrix, from which cot^ 6'c = 1 or 6'c = 7r/4 follows. 

A. 3. A sketch of the proof of Lemma [2] 

For 5 > -1, let G = {T{S + 2t-i- j + 1))^^. .^^. For 1 < A; < let Bk = (&fc,^i)l<^J<^ 
be an upper band matrix with the (i, j)th element 

1 ifi=j, 
bk,ij = •{ —{5 + t — i) if i + 1 = J and i <t — k, 
otherwise. 

It can be confirmed that -B^-i ■ ■ ■ B2B1G = ETD, where T = (tij), 

if ^ < J, 



is a lower triangular matrix, and 



D = diag(r(5 + t-t + E = diag((t - 



<i<t 
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The product of becomes an upper triangular matrix Bt^i ■ ■ ■ -B2-B1 = B = (bij) with 



The inverse matrix of T becomes T ^ = {f^) with = {—ly^Hij. 

Then, the inverse matrix of G is calculated as = D^^T^^E^^B. Setting 6 = e — 1/2 
yields flT5|l . 

Further, note that 



This identity f ll9p provides another proof that dp in dH]) , Theorem [H is actually the normal- 
izing constant. 
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